Abstract Generation Based On Rhetorical Structure Extraction
نویسندگان
چکیده
generation is, like Machine Translation, one of the ultimate goal of Natural Language Processing. However, since conventional word–frequency– based abstract generation systems(e.g. [Kuhn 58]) are lacking in inter-sentential or discourse-structural analysis, they are liable to generate incoherent abstracts. On the other hand, conventional knowledge or script–based abstract generation systems(e.g. [Lehnert 80], [Fum 86]), owe their success to the limitation of the domain, and cannot be applied to document with varied subjects, such as popular scientific magazine. To realize a domain-independent abstract generation system, a computational theory for analyzing linguistic discourse structure and its practical procedure must be established. Hobbs developed a theory in which he arranged three kinds of relationships between sentences from the text coherency viewpoint [Hobbs 79]. Grosz and Sidner proposed a theory which accounted for interactions between three notions on discourse: linguistic structure, intention, and attention [Grosz et al. 86]. Litman and Allen described a model in which a discourse structure of conversation was built by recognizing a participant’s plans [Litman et al. 87]. These theories all depend on extra-linguistic knowledge, the accumulation of which presents a problem in the realization of a practical analyzer. Cohen proposed a framework for analyzing the structure of argumentative discourse [Cohen 87], yet did not provide a concrete identification procedure for ‘evidence’ relationships between sentences, where no linguistic clues indicate the relationships. Also, since only relationships between successive sentences were considered, the scope which the relationships cover cannot be analyzed, even if explicit connectives are detected. Mann and Thompson proposed a linguistic structure of text describing relationships between sentences and their relative importance [Mann et al. 87]. However, no method for extracting the relationships from superficial linguistic expressions was described in their paper. We have developed a computational model of discourse for Japanese expository writings, and implemented a practical procedure for extracting discourse structure[Sumita 92]. In our model, discourse structure is defined as the rhetorical structure, i.e., the compound of rhetorical relations between sentences in text. Abstract generation is realized as a suitable application of the extracted rhetorical structure. In this paper we describe briefly our discourse model and discuss the abstract generation system based on it.
منابع مشابه
Interfaces of Macro and Microstructure in Academic Writing: The Case of Research Article Abstracts
Abstract Although flourishing research has been devoted to research on article abstracts, more studies are needed to unpack the relationship between rhetorical moves and their associated linguistic and rhetorical features (e.g., metadiscourse). To underpin this relationship, the current study analyzed a total of 60 research article abstracts written in English by two cultural groups in three di...
متن کاملArgumentative Classiication of Extracted Sentences as a Rst Step towards Exible Abstracting
Knowledge about the rhetorical structure of a text is useful for automatic abstraction. We are interested in the automatic extraction of rhetorical units from the source text, units such as Problem Statement, Conclusions and Results. We want to use such extracts to generate high-compression abstracts of scientiic articles. In this paper, we present an extension of Kupiec, Pedersen and Chen's (1...
متن کاملArabic Rhetorical Relations Extraction for Answering "Why" and "How to" Questions
In the current study we aim at exploiting discourse structure of Arabic text to automatically finding answers to non-factoid questions ("Why" and "How to"). Our method is based on Rhetorical Structure Theory (RST) that many studies have shown to be a very effective approach for many computational linguistics applications such as (text generation, text summarization and machine translation). For...
متن کاملThe Impact of Summary Writing with Structure Guidelines on EFL College Students’ Rhetorical Organization: Integrating Genre-Based and Process Approaches
This study aimed at investigating the impact of writing on Iranian EFL college students’ rhetorical organization. Thirty Iranian female undergraduate students majoring in English at Al-zahra University participated in the current study. The writing instructions included two stages, each lasting for four weeks. The participants were assigned to a control group and an experimental group according...
متن کاملThematic Progression in the Rhetorical Sections of an Online Iraqi English Newspaper
Abstract Thematic development refers to the way theme and rheme in the clause are developed. The theory of rhetorical structure can be defined as the strategies that follow specific ways to make writing more persuasive. The present study aimed to examine how Iraqi writers maintain cohesion in the text by analyzing the patterns of thematic progression in various rhetorical sections in an online ...
متن کامل